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Agriculture is a crucial element to build a strong economy, not only because 
of its importance in providing food, but also as a source of raw materials for 
industry as well as source of energy. Different diseases affect plants, which 
leads to decrease in productivity. In recent years, developments in 
computing technology and machine-learning algorithms (such as deep neural 
networks) in the field of agriculture have played a great role to face this 
problem by building early detection tools. In this paper, we propose an 
automatic plant disease classification based on a low complexity 
convolutional neural network (CNN) architecture, which leads to faster on- 
line classification. For the training process, we used more than one 57,000 
tomato leaf images representing nine classes, taken under natural 
environment, and considered during training without background 
subtraction. The designed model achieves 97.04% classification accuracy 


classification and less than 0.2 error, which shows a high accuracy in distinguishing a 
disease from another. 
This is an open access article under the CC BY-SA license. 
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1. INTRODUCTION 

Around 2.3 billion people in the world were moderately or severely food insecure in 2021, or nearly 
30% of the global population -more than 350 million more people than in 2019, the year before the 
coronavirus disease of 2019 (COVID-19) pandemic unfolded. They are facing chronic hunger and most of 
them, 795 milions, live in Africa [1]. This continent is thus categorized as a food deficit nation. Thus, 
agriculture is the backbone of developing countries, it is an essential source of income that ensures the 
economic growth and food security, not only because it provides food for our daily life, but also, because of 
the dependence of all the industries on agriculture both directly and indirectly. Furthermore, agriculture is a 
major source of employment for people to face hunger, so it is crucial to develop the agriculture sector in any 
country. Agriculture always faces loss of crops caused by natural conditions, seeds quality and price in the 
international market, nutrition, or diseases. 

Tomato is a popular vegetable crop in all countries. Under undesirable conditions: light, 
temperature, water, seeds quality and nutrition, this crop can be affected by a lot of pathogens like bacterial, 
fungal, viral, insects and nutrition deficiencies problems. These diseases are easily spreadable which causes 
big damages in quality and quantity. Some diseases have not visible symptoms and are identified only 
through sophisticated laboratory analysis, which may be too late to prevent any possible damage in crops. On 
the other hand, most of these diseases have visible symptoms that can be detected by the expert (plant 
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pathologists) in early stages through visual perception, however, continuous monitoring costs highly in the 
case of large cultivated surfaces. 

The great development and advances in computer vision and neural networks leads to design a 
technological support (automated system) for plant diseases identification, standing by their visual 
symptoms, and this will help simple farmers to achieve an accurate plant diseases diagnostics. In [2], two 
algorithms were used, the first employs visual geometry group (VGG16) as a feature extractor with support 
vector machine (SVM) as a classifier, the second is the original VGG16 model fine-tuned, to build a 
classification model. They found that the second model gives better result than the first with a classification 
accuracy of 89%. Ferentinos [3], authors used 87,848 photographs, captured in the fields. These photographs 
contain 58 distinct classes and were employed to train: AlexNet, AlexNetOWTBn, GoogLeNet, and VGG, 
the best result achieved is 99.53% accuracy (0.47% error), by VGG model. 

Picon et al. [4] shows a new approach to detect three fungual diseases (septoria, tan spot and rust), 
on wheat images. Based on the previous work by Johannes ef al. 2017, they develop a new algorithm with an 
adapted deep residual neural network, in the aime of detecting these three diseases in real time. The results 
show that the best result was obtained by the new approach: 87% of accuracy compared by the previous work 
of Johannes et al. which was 78%. 

Ouhami et al. [5] compare three transfer learning models: dense convolutional network (DenseNet), 
(161 and 121 layers) and VGG16 networks. Their study was based on 666 infected plant leaves images, 
captured in the fields in Morocco, in addition to the images collected from the internet; all of them were 
categorized in 6 classes. The best result was 95.65%, obtained by DensNet161. Agarwal et al. [6] designed a 
convolutional neural network (CNN) based on three convolutional-maxpooling layers followed by two fully 
connected layers, for training process, they used 50,000 images (plant village) for 10 classes, the final results 
show the high performance of the designed model over VGG16, InceptionV3 and MobileNet pretrained 
models, with an accuracy of 91.2%. Morgan et al. [7], they used artificial neural network (ANN), Naive 
Bayes, k-nearest neighbor (KNN), support vector machine (SVM), decision tree and random forest, to 
classify and predict soybean and mushroom diseases. In mushroom dataset, 8,124 hypothetical samples of 23 
species were used to determine if there is a disease or not, the result achieved using all algorithms except 
Naive Bayes, is 100%. In soybean dataset, 307 observations with 19 different diseases were used to 
determine which disease was present; they found that the ANN and KNN classifiers give the best result, 
which is more than 91% of accuracy. 

Zaki et al. [8], 4,621 tomato leaf images obtained from PlantVillage dataset was employed to fine 
tune MobileNet V2 model parameters; and classify four diseases classes, the experimental results show an 
accurate classification performance, with more than 90% of accuracy. Sladojevic et al. [9], the CNN 
approach was used to classify 13 different types of plant diseases as well as distinguish the leaves of the plant 
from their surroundings. The experimental results achieve an average accuracy of 86.3%. Aquil and Ishak 
[10] used CNN method to classify 54,306 images of 26 diseases across 14 species. The dataset was used to 
train different CNN models (DenseNet 120, ResNet (101, 50, 30, 18), SqueezeNet and VGGNet (19, 16), the 
best result achieved is 99.68% accuracy, by DenseNet 120. Kawasaki et al. [11], they used 800 leaf images to 
classify two viral diseases that affect cucumber plant: melon yellow spot virus (MYSV) and zucchini yellow 
mosaic virus (ZYMV). The proposed model achieved an average accuracy of 94.9%. Via tansfer learning 
with various architectures, [8], [12]—[24], demonstrate that the pre-trained networks pove a strong ability to 
generalize to new situations (images of plant leaf diseases), by fine-tuning, the pre-existing models. 

Given the enormous complexity of the network structures used in the previously cited works, the 
proposed structure is optimally designed to have the least complex model in terms of network depth (number 
of layers) [25], [26]. This optimization is not only used to achieve computational efficiency, but also 
improves the generalizability to classify the diseases. Thus, the main contributions to this work are: i) an 
automatic tomato plant diseases classification method based on a low complexity CNN architecture, which 
leads to faster on-line classification; ii) the importance of data augmentation procedure, to avoid overfitting 
and achieving better results; iii) the influence of hyperparameters to accelerate learning and get a stable 
model; and iv) the results show a high accuracy in distinguishing a disease from another. 


2. METHOD AND MATERIAL 
2.1. CNN a theoretical background 

The advances in computer vision and machine learning has been constructed and perfected with time, 
one of the methods that has shown an excellent performance is the CNN method. CNNs are regularized 
versions of multilayer perceptron (MLP) which is in turn a class of feedforward ANN [27]. In CNN 
architecture, the vector of weights and bias is called filter, each filter represents a particular feature of the 
input (e.g., shape in an image). 
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Figure | illustrates how a CNN runs layer by layer in a forward pass, where x? is the image (order 3 
tensor); w! parameters involved in the i*"layer; x! the output of the i‘’-layer, which also acts as the input to 
the i‘layer; and w” loss layer. The parameters of the CNN are optimized to minimize the loss z. To get the 
loss we need to compare the CNN model prediction x“ with the ground truth labels or target corresponding 
tox?. On another hand and when the loss z is achieved, we use this supervision signal to update the 
parameters of the model such (1): 


iyl+1 — (il _ » 92 
(wy = (wi) () 


Where: 77 is the learning rate and / is iteration number, i is the layer index [21], [28]. 
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Figure 1. CNN structure, running in a layer by layer; in a forward pass 


“In standard gradient descent algorithm, the parameters of the model are updated using one gradient 
calculated using one training example. The stochastic gradient descent (SGD) algorithm evaluates the 
gradient and updates the parameters using a subset of the training set, this subset is called mini-batch where 
every gradient evaluation is an iteration, each iteration is one step for minimizing the loss function, and an 
epoch is the full pass of the training algorithm over all the training data set using mini-batches” [29], [30]. 
Note that when we use mini-batch strategy; the CNN input becomes an order 4 tensor (H, W, D, mini-batch 
size). The stochastic gradient descent algorithm oscillates towards the optimum. We use momentum method 
to help accelerating the gradient in the right direction and reducing oscillations, by adding a fraction 7 of the 
update vector of the past time step to the current update vector: 


Cw)? = (wy! — nVz((w'y') + yw)! — (w')) (2) 


Where I is the iteration number,y > 0 is the learning rate,w is the parameter vector, and z(w) is the loss 
function. The gradient of the loss function is Vz(w). y determines the contribution of the previous gradient 
step to the current iteration [31], [32]. 


2.2. Hardware and software environment 

The training step requires graphics processing units (GPUs) because of the huge amount of 
computing. The deep learning was run under an Intel Core i7-7700 CPU @ 3.60GHz system, with the 
graphic card NVIDIA GeForce GTX 1080 Ti, powered by its GPU. This helped significantly to reduce the 
execution time, and allowed running many simulation scenarios. 


2.2.1. Dataset 

Besides the data downloaded from the internet (searched in terms of two keywords: the disease and 
plant name), the tomato leaf disease images have been taken from the green houses of the CRESTRA 
research center in Biskra-Algeria; where diseases are induced on tomato plant to be used later for biological 
experiments. Finally, 1210 images, in red-green-blue (RGB) color space, joint photographic experts’ group 
(JPEG) standard format, for 9 classes shows in Figure 2, Table 1 were obtained, where each class is a disease 
label. Any tomato leaf image was captured in its own environment, i.e., with a random background. 


2.2.2. Image preprocessing 

It’s important to utilize accurately classified images. We asked an agriculture-expert to examine the 
leaf images and label them with the appropriate disease acronym. Only higher resolution images with region 
of interest and containing all the needed information for feature learning were considered, they were resized 
to 256x256 to make feasible the training and reduce its time. 


2.2.3. Augmentation process 

We apply augmentation to reduce overfitting during training. It’s about enlarging the data set by 
producing a slight distortion (transformation) to the images: rotation, translation, shearing, scaling and 
reflection [33]. Thus, we obtain the longer database Table 1. 
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Figure 2. Data-base classes. (upper left-to-lower right): canker, early blight, late blight, magnesium 
deficiency, healthy, leaf mould, leaf miner; septoria leaf spot, powdery mildew 


Tablel. Overview on the plant-diseases database [34] 


Class Number of Total number Leaf symptoms 
original images of images 

Early blight 165 8244 Round small dark leaf spots (can be mor than % inch in diameter) with 
yellow tissue around the spots. The spots appear as concentric rings. 

Late blight 82 5743 Dry dark brown and irregular spots, surrounded by green edge. 

Septoria leaf spot 163 8172 Numerious dark brown circular spots that are the size of a pencil tip or 
larger. We can see in the center black specks. 

Leaf mould 124 7256 Greenish to yellow spots (less than % inch) on the upper leaf surface, and 
olive-green spots on the leaves upper side. 

Leaf miner 149 1094 Irregular lines or tunnels on both sides of the leaf, inside these tunnels we 
can find a remarkable blotch looks like a black fecal material. 

Powdery mildew 278 9667 We see powdery spots on the upper and lower sides of the leaf, in 
addition to yellow spots. 

Bacterial canker 79 5574 At the margin of leaves, appear dark brown lesions, we can find also a 
round spots. 

Magnesium deficiency 100 6307 Generally, plants develop an interveinal chlorosis on tomato plant leaves 

Healthy 70 5148 - 


2.2.4. Layer configuration and parameters of CNN 

The proposed model consists of three convolution-maxpooling layers, the three last layers are: fully 
connected layer followed by softmax and classification layers. The model is trained, in Matlab environment, 
using SGD method with a constant learning rate equal to 0.0007. The network parameters were modified by 
trial and error. The database was divided into: 80% for training and 20% for validation. 


3. RESULT AND DISCUSSION 
3.1. Data augmentation 

We showed earlier the importance of data augmentation and its influence to avoid overfitting which 
is one of the current problems that occur during learning. In Figure 3, classification accuracy and loss on the 
training dataset is marked in blue and red for the training dataset; and in black dotted line for the validation 
dataset. When we use a small dataset (1,210 images), we notice that the accuracy or the error curves obtained 
on the training set, goes twards 100% of accuracy (or 0.001 error), while the accuracy or error curves 
obtained on validation, stops improving after a certain number of epochs and begins to be stable twards 58%. 
This shows that the model works more poorly as Figure 3(a), this is explained by the fact that the training 
examples were memorized by the network, but it has not the ability to generalize to new cases. 

Several ways are used to reduce overfitting, the best of them is to get more training data. When we 
use a big dataset (57,205 images), that was obtained by applying image data augmentation procedure, we 
noticed that the model fits well on training dataset and generalizes well to the validation dataset with a high 
validation accuracy, 97.04%, and a very small error shows in Figure 3(b). In this case we have a good fit 
model. 
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Figure 3. Accuracy and loss of classification (a) without augmentation and (b) with augmentation process 
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3.2. Momentum (7) 

Here, we show the influence of momentum on the model performance. In order to gain time, we just 
used four diseases classification model (powdery mildew, early blight, septoria and leaf mould). In Figure 4 
we show line plots for different momentum (y) values, and compare their effect to the SGD algorithm, with a 
constant learning rate. Classification accuracy and loss on the training dataset are marked respectively in blue 
and red, whereas accuracy and loss on the validation dataset are marked in black dotted line. The proposed 
method shows the ability to learn very well with all momentum values (y=0, y=0.5, y=0.75, y=0.9), which 
explains the stability of the model. On the other hand, we see that adding momentum can accelerate training 
and gives a more stable model. 


sh i SST Sy 


cncey ——Taingsntoted) — Tag ~ # Vat 
Accuracy —!—Training{smoothed) 


Training ~ © ~ Validation 


Accuracy (%) 


10 20 0 0 


0 100 200 300 400 500 600 700 a ‘0 mm so % am a0 1% 
Iteration teraton 


Liss —— Tenses} — Tang ~ # ldion 


le Loss —'—Traringlsmoothed) —— Training — @ ~ Validation | 


“sell a! whl, 
os} { 


NY AV I | 

RAP HL nl | | 
"y al! ni) } ’ | | | | iI | | 
Me Wu a 


0 100 200 


400 500 600 700 a 100 mm 0 4 say a0 ™ 
Iteration ieraton 


1 — oie 
ee ee ee ee ee eo 0-6 0-0 0-9 0-0 0-0 0-0 0-0 0-0 0-0 0-08 06 oe 


Accuracy —*— Training/smoothed) ‘Training — @ ~ Validation 90 


Accuracy —!—Traning(smoothed) —— Training — @ Validation 


Accuracy (%) 
Accuracy (%) 


40 


10- 

10 20 10 20 
0 i Ll L i 

200 300 400 500 600 700 0 400 200 300 400 500 600 700 


Iteration Iteration 


Loss —'—Training(smoothes) —— Training — & —Validation Loge 9 Trening (smoothed), = Traking. — @ = Vek 


Win, toe *-e #-e 0 fe 0-2 0-0 0-0 0-0 0 0-0 0-6 0-0 0-0 -970-— 
na ! 


0 4100 200 300 400 500 600 700 0 400 200 300 400 500 600 700 
Iteration Iteration 


Figure 4. Accuracy and loss of classification with different momentum values, (upper left-to-lower right): 
y=0; y=0.5; y=0.75; y=0.9 


3.3. Learning rate (n) 

The learning rate is a configurable hyperparameter which refers to the amount that the weights are 
updated during training. Figure 5 shows line plots for different learning rate values (7), and compares its effect 
on the SGD algorithm, with a momentum=0.9. Classification accuracy and loss on the training dataset are 
marked in blue and red, whereas accuracy and loss on the validation dataset are marked in black dotted line. 
We can see for 7 = 7:107*, and n = 7-107’, oscillations, and inability of the model to learn; for 7 = 7° 
1073, Hn=7- 10-*, and n=7- 10-5, less oscillations, and ability of the model to learn well. The best 
accuracy and loss obtained with n = 7: 10~*. Decreasing the value of 7, leads to a slow model. 
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Figure 5. Accuracy and loss of classification with different learning rate values, (upper left-to-lower right): 
q= 7107.4 = 7107? 4 = 7710" 4 = 710° S710 


4. CONCLUSION 


In this study we tried to focus on tomato plant diseases detection and classification using leaf images 
which are captured in the field, or downloaded from the internet, in their environment without background 
subtraction. The proposed CNN architecture involves only three convolutional layers, which allows a 
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reduced computational complexity, and this is very useful for applications that require fast on-line decision 
making. Thus, we achieved 97.04% accuracy and less than 0.2 error, by this we meant that one should not be 
impressed when a complex model fits a dataset well, with enough parameters we can fit any dataset, and it 
will be easy to reach ~99% accuracy by a continuous tuning process. The first part of experiments shows the 
importance of data augmentation procedure to avoid overfitting and achieve better results. In the second part 
of this study, we improved: The considerable effect of learning rate to the stochastic gradient descent 
algorithm, this hyperparameter should be the first to adjust; and the importance of using SGD algorithm with 
momentum to accelerate learning and get a more stable model. 
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